Rank in Wordlist | Frequency | Word |
---|---|---|
3655 | 8 | ၁,၀၀၀ |
4209 | 7 | ၁,၀၀၀/- |
4210 | 7 | ၁၀,၀၀၀ |
8053 | 4 | ၅၀,၀၀၀ |
8192 | 3 | Department,JICA |
11414 | 3 | ၁,၀၀၀/ |
11450 | 3 | ၁၅,၀၀၀ |
11540 | 3 | ၃၀,၀၀၀ |
11897 | 2 | Co,Ltd |
21108 | 2 | ၁၀,၀၀,၀၀၀ |
Rank in Wordlist | Frequency | Word |
---|---|---|
1541 | 18 | ၅၀% |
3656 | 8 | ၁၀% |
3659 | 8 | ၁၀၀% |
3676 | 8 | ၂၀% |
4970 | 6 | ၅% |
6104 | 5 | ၂၉% |
7997 | 4 | ၂% |
8020 | 4 | ၂၅% |
11446 | 3 | ၁၄% |
11470 | 3 | ၁၉% |
Rank in Wordlist | Frequency | Word |
---|---|---|
6195 | 4 | E&P |
8179 | 3 | D&C |
23922 | 1 | M&E |
24628 | 1 | R&D |
Rank in Wordlist | Frequency | Word |
---|---|---|
3747 | 7 | Women's |
12327 | 2 | Myanmar's |
16434 | 2 | ဘဘ''''ေအး |
22737 | 1 | D'Youville |
23517 | 1 | Int'L |
24495 | 1 | People's |
25175 | 1 | Taylor's |
25184 | 1 | Telenor's |
25818 | 1 | က'ပဲ |
25819 | 1 | က'ျဖစ္သြားတယ္ေနာ္ |
Rank in Wordlist | Frequency | Word |
---|---|---|
11717 | 2 | 3+2 |
19668 | 2 | အာဆီယံ+၃ |
21955 | 1 | 2018-05-16T15:15:00+00:00 |
21956 | 1 | 2018-05-16T15:16:54+00:00 |
21957 | 1 | 2018-05-16T15:52:20+00:00 |
21958 | 1 | 2018-05-18T11:56:02+00:00 |
24391 | 1 | P5+1 |
61795 | 1 | မြန်မာ+အင်္ဂလိပ် |
63225 | 1 | ယ+၂ |
63226 | 1 | ယ+၃ |
Rank in Wordlist | Frequency | Word |
---|---|---|
680 | 39 | ၁/၂၀၁၉ |
796 | 33 | ၁/၂၀၁၈ |
1141 | 24 | ၃/၂၀၁၉ |
1199 | 22 | ခရိုင်/ |
1208 | 22 | တိုင်းဒေသကြီး/ |
1237 | 22 | ၂/၂၀၁၉ |
1259 | 21 | တပ္ရင္း/တပ္ဖြဲ႕မ်ားမွ |
1315 | 21 | ၁/၂၀၁၇ |
1537 | 18 | ၁၀၀/- |
1692 | 16 | ရှိ/မရှိ |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots